首页> 外文OA文献 >A feature construction framework based on outlier detection and discriminative pattern mining

【2h】

A feature construction framework based on outlier detection and discriminative pattern mining

机译：一种基于异常值检测和特征的特征构造框架判别模式挖掘

页面导航

摘要
著录项
相似文献
相关主题

摘要

No matter the expressive power and sophistication of supervised learningalgorithms, their effectiveness is restricted by the features describing thedata. This is not a new insight in ML and many methods for feature selection,transformation, and construction have been developed. But while this ison-going for general techniques for feature selection and transformation, i.e.dimensionality reduction, work on feature construction, i.e. enriching thedata, is by now mainly the domain of image, particularly character,recognition, and NLP. In this work, we propose a new general framework for feature construction.The need for feature construction in a data set is indicated by class outliersand discriminative pattern mining used to derive features on theirk-neighborhoods. We instantiate the framework with LOF and C4.5-Rules, andevaluate the usefulness of the derived features on a diverse collection of UCIdata sets. The derived features are more often useful than ones derived byDC-Fringe, and our approach is much less likely to overfit. But while a weaklearner, Naive Bayes, benefits strongly from the feature construction, theeffect is less pronounced for C4.5, and almost vanishes for an SVM leaner. Keywords: feature construction, classification, outlier detection

机译：无论监督学习算法的表达能力和复杂程度如何，其有效性都受到描述数据的特征的限制。这不是ML的新见识，并且已经开发了许多用于特征选择，转换和构造的方法。但是，尽管对于特征选择和变换的通用技术（即降维）正在进行中，但到目前为止，特征构建（即丰富数据）的工作主要是图像领域，尤其是字符，识别和NLP。在这项工作中，我们提出了一种新的要素构建通用框架。分类异常值和用于在其k邻域上导出要素的判别模式挖掘表明了数据集中要素构建的需求。我们用LOF和C4.5-Rule实例化该框架，并在各种UCIdata集上评估派生功能的有用性。派生的功能比DC-Fringe派生的功能更有用，并且我们的方法不太可能过拟合。但是，尽管功能弱的学习者Naive Bayes从功能构造中受益匪浅，但对于C4.5而言，效果并不明显，而对于SVM精简者，效果几乎消失了。关键字：特征构造，分类，离群值检测

著录项

作者
Zimmermann, Albrecht;
展开▼
作者单位

展开▼
年度 2014
总页数
原文格式 PDF
正文语种 {"code":"en","name":"English","id":9}
中图分类

相似文献

外文文献
中文文献
专利

1. Application of High-Dimensional Outlier Mining Based on the Maximum Frequent Pattern Factor in Intrusion Detection [J] . Limin Shen, Zhongkui Sun, Lei Chen, Mathematical Problems in Engineering: Theory, Methods and Applications . 2021,第a期

机译：高维异常挖掘在入侵检测中的最大频繁图案因子中的应用
2. An efficient outlier detection approach on weighted data stream based on minimal rare pattern mining [J] . Cai Saihua, Sun Ruizhi, Hao Shangbo, Communications, China . 2019,第10期

机译：基于最小稀有模式挖掘的加权数据流的高效异常检测方法
3. Data stream outlier detection approach based on frequent pattern mining technique [J] . Aiman Moyaid Said, P.D.D. Dominic, Ibrahima Faye International journal of business information systems . 2015,第1期

机译：基于频繁模式挖掘技术的数据流离群值检测方法
4. NDPMine: Efficiently Mining Discriminative Numerical Features for Pattern-Based Classification [C] . Hyungsul Kim, Sangkyum Kim, Tim Weninger, ECML PKDD 2010;European conference on machine learning and knowledge discovery in databases . 2010

机译：NDPMine：为基于模式的分类有效挖掘区分性数值特征
5. Discriminative Feature Extraction of Time-Series Data to Improve Temporal Pattern Detection using Classification Algorithms [D] . Stolze, David 2018

机译：使用分类算法区分时间序列数据以提高时间模式检测的特征
6. Convolutional Neural Network-Based Discriminator for Outlier Detection [O] . Fahad Alharbi, Khalil El Hindi, Saad Al Ahmadi, 2021

机译：基于卷积神经网络的异常检测鉴别器
7. Ndpmine: Efficiently mining discriminative numerical features for pattern-based classification [O] . Hyungsul Kim, Sangkyum Kim, Tim Weninger, 2010

机译：Ndpmine：有效地挖掘用于基于模式的分类的判别性数字特征

A feature construction framework based on outlier detection and discriminative pattern mining

摘要

著录项

相似文献

相关主题

期刊订阅